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Abstract 



Wc define a semantic complexity class based on the model of quan- 
tum computing with just one pure qubit (as introduced by Knill & 
Laflamme) and discuss its computational power in terms of the prob- 
lem of estimating the trace of a large unitary matrix. We show that this 
problem is complete for the complexity class, and derive some further 
fundamental features of the class. We conclude with a discussion of 
some associated open conjectures and new oracle separations between 
classes. 



1 Introduction 

The idea of doing quantum computation with just one pure qubit 
is relevant to certain problems in the theory of quantum chaos [5] , 
and possibly relevant in terms of choosing an architecture to 
pioneer practical quantum computation, since at present there 
would seem to be little agreement about which technology is 
most ht for implementing a general purpose quantum proces- 
sor, pQ. Yet the literature seems to lack a discussion of the 
complexity classes naturally associated to the one-pure-qubit 
paradigm. Taking a computer science perspective, we will de- 
fine a class analogous to BQP using the circuit model, discuss 
some basic conjectures about that class, provide a definition of 
relativisation for the model, and show some evidence for the con- 
jectures in terms of that relativisation. It is hoped that this will 
help promote the search for algorithms in other paradigms. 

DQC1 as a computational model, ( "Deterministic^] Quantum 
Computation - 1 pure qubit" , as defined in [5] ) is an apparently 
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less powerful computational idea than 'full' quantum computa- 
tion, yet it appears to be able to do certain things exponentially 
more quickly than can a classical device. In particular, the origi- 
nal paper [5] explains something of the relation to the classically 
hard problem of trace estimation. In this article, we use the idea 
of computational circuits to derive a proper complexity class for 
the DQC1 model, in order to ask about the languages that can 
be resolved with bounded probability in polynomial time, and 
we address conjectures concerning the ability of families of such 
circuits to resolve some of the 'classical' languages belonging to 
classes such as P and BPP. 

There is a nice description of general quantum circuits in [3], 
where it is shown how circuits can naturally be composed when- 
ever the output of one circuit is of the same "type or kind" as 
the input to the next. The authors also point out that it can 
sometimes be useful to have the elements of a quantum circuit 
depend on a classical string that has the status of 'input', to 
allow for notions of adaptation. Both of these themes are key 
to the development of a computational class that expresses the 
power of the DQC1 model. 

This first section recaps some basic notation used ubiqui- 
tously within Quantum Information Science, recaps some def- 
initions of circuits as used in context of computational com- 
plexity theory, and introduces a new complexity class for cap- 
turing the power of the DQC1 methodology within the context 
in which we're interested. Section [2] illustrates in detail a well- 
known complete problem for the "one pure qubit" approach, and 
shows how our complexity class is relevant to that problem. Sec- 
tion looks in more detail at the new class, proving some basic 
properties about it and showing that it is closed under certain 
kinds of reduction. Section 0] makes some conjectures about how 
our complexity class stands in relation to other common classes, 
analysing the strength of the conjectures with regard to known 
hard challenges of complexity theory, and providing 'evidence' 
for believing the conjectures despite an absence of formal proof. 



Algebra Notation 

We use the algebra formally defined by 

<D[ X i} Zi } w , 
( X i —1,2? — 1) (Xu Zi), [Xi, Xj], [Xi, Zj] ).^. ' 

where w is a positive integer, the width (in qubits) of a quantum 
circuit. This algebra is nothing other than the ring of square (2 W - 
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by-2 w ) complex matrices, but it is useful to be able to refer to the 
algebra without having to speak explicitly of matrices. On it we 
can define Trace (TV) and adjoint (f ) in the usual manner, these 
having the properties that one would expect of a matrix algebra. 
(Note that the value of Tr[l] will depend on which A w the 1 is 
taken from.) X and Z denote the usual Pauli operators. The 
unitary transformations of quantum physics correspond with the 
automorphisms p i— ► U ■ p • LP, for unitary U. 

The DQC1 model is characterised by requiring the starting 
state of any computation to be highly mixed except for one pure 
qubit : the starting state is given as the (Hermitian) density 
operator 

1 + Zi 

P start : = 2W G Aw- ( 2 ) 

The formalism must expressly forbid any non-unitary gates, 
since the use of 'zeroise' gates, for example, will readily enable 
one to purify more qubits and then perform a computation on 
many pure qubits. Instead, the only allowed actions on the den- 
sity operator (within DQC1,) besides a final measurement, are 
conjugations by unitary operators (i.e. inner automorphisms.) 
Thus the states (density operators) reachable from the starting 
state using only these allowed automorphisms have the form 



1 + UZjrf l + PZi + y/l- p 2 R 

P 2«j ' 2 W ' 

where R and Z\R are traceless, and f3 is real. This format for 
writing a generic state is chosen so as to highlight the coefficient 
in front of the Z\ term, which we will later define as the 'output' 
of a circuit whose gates implelemt the unitary U. That (3 is real 
can be seen from its definition as j3 := Tr(U Z^W Z\) /2 W and 
the fact that U Z\XJ^Z\ is a product of two hermitian operatorsJl 
That R and Z\R are traceless can be seen from the definition 
R := uz ^' < ~I 3 ^l_ ("We leave R undefined in the cases where 

/3 2 = 1.) 

We think of R as coding the state of the 'workspace' of a 
DQC1 algorithmic process, and (3 as coding the 'output' of the 
process. The final measurement, to be applied to the state af- 
ter the completion of the unitary circuit, will measure the first 
qubit in the computational basis. The probability of obtaining 
"0" from this reading for the state described in equation © is 



2 (AS) 1 " = BU 1 = BA; Tr[BA] = Tr[AB\. 
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therefore given by 
F("0") 



Tr 



P- 



l + Zx 



1 + 13 



(4) 



Entanglement 

It is worth mentioning here something about the entanglement 
of states of the form of equation ([3]). While we won't be consid- 
ering any particular applications of entanglement, we note that 
entropy considerations alone guarantee that, for any reasonable 
measure, there won't be much entanglement present in such a 
system. But it would be inaccurate to say that such states are 
necessarily free of all quantum correlations. An example of an 
entangled state would be 

PEnt := ( 2 + X X X 2 - Y X Y 2 + Z x - Z 2 ), (5) 

which is prepared from the state p s t a rt by the unitary operator 

U = ~{l + Z 1 +X 2 -Z 1 X 2 )-~{l-Z 2 + H 1 + H 1 Z 2 ). (6) 

To see that it is entangled, note that pEnt satisfies 

Tr[p-(1-Z 1 )(l + Z 2 )] = 0, (7) 
Tr[p-{X 1 + iY 1 )(X 2 + iY 2 ) ] ± 0. (8) 

However, any pure product state 

_L(1 + aiXl + fcYx + 7l Zx) • (1 + a 2 X 2 + {3 2 Y 2 + 72 Z 2 ) ■■■ (9) 

either satisfies equality © and then fails the inequality ([8|), or 
else satisfies the inequality ([8]) but then fails the equality ([7|) by 
contributing a strictly positive (real) amount. Therefore pE nt 
cannot be a convex combination of pure product states. 



Circuits in general 

Here we recap a few basic definitions for circuits and computation 
via circuits. A classical deterministic circuit is usually defined 
(e.g. |2j) as a directed acyclic graph whose vertices are either 
'inputs' or 'constants' (with no indegree), 'NOT gates' (with in- 
degree 1), or 'AND' or 'OR' gates (with indegree 2). The vertices 
with no outdegree are called 'outputs'. The language obtained 
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from an infinite family {C\, C2, •••} of such circuits, where circuit 
C n has n inputs and one output vertex, is the set of strings 

C = {xG{0,l}* : a w (x) = l}. (10) 

Here C n (x) denotes the output of circuit C n when the gates 
are evaluated after input string x (of length n) is used to load 
boolean values into the 'input' vertices. Then whenever the 
family is described by a logspace Turing machine, we have that 
£gP, Such families are called uniform, and are always polyno- 
mially bounded [2]. 

There are many ways to introduce randomness to this model. 
For example, one may allow for extra vertices of zero indegree 
that are to be initialised randomly, or one may incorporate 'coin- 
flip' vertices with indegree 1 that 'overwrite' data with random 
data. Within models which allow for some randomness, we rede- 
fine C n {x) to be the probability of the output vertex evaluating 
to "0". If there's a guarantee that C\ x \(x) (|, |), for all strings 
x, then a language in BPP may be derived. This class is a se- 
mantic class since it depends on a guarantee. There is nothing 
special about the (^,|) limitation in that guarantee, since any 
non-negligible separation in the probabilities can be amplified by 
repeated computation [2j. 

Quantum circuits are usually defined similarly [1] with uni- 
tary gates replacing ordinary vertices, and measurement gates 
relpacing output vertices. Unitary gates have the same indegree 
as outdegree, because they are reversible. Measurements are usu- 
ally taken to be single-qubit measurements in the computational 
basis, (see pQ for more details.) Thus is the class BQP derived. 
As well as being aptly notated pictorially by a directed acyclic 
graph, a quantum circuit can equally well be notated simply as 
a sequence of the unitary operators applied, which is to say we 
can describe it by listing a series of unitary elements from A w , 
to be applied sequentially to some starting input. Without loss 
of generality, all measurement can be delayed to the end of the 
quantum circuit, and since only one bit of output is required for 
decision problems, that output measurement can then be a mea- 
surement of Z on the first qubit, with other qubits being traced 
out. 

DQC1 Circuits 

For DQC1 circuits we approach the problem slightly differently, 
because we need a different notion of 'circuit input' to satisfy 
the DQC1 framework of allowable states. Because the actual 
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quantum input into the circuit must necessarily be of the form 
prescribed by equation ([2]) for DQC1 circuits, we need an alter- 
native way of getting the input string x into the computation. 
So we recycle the idea of a circuit being a list of unitary oper- 
ators from Aw, but each element of the list will be a choice of 
two unitary operators, the actual one to apply being selected 
by a (classical) bit of x. Thus an example of a description of a 
small DQC1 circuit would be {A\B) Xl ■ (C\D) X2 ■ (E\F) X1 . This 
circuit, on input x = "01", would perform the automorphism 
specified by conjugation by the unitary A ■ D ■ E on the starting 
state, but on input x ="11" would conjugate the starting state 
by B ■ D ■ F, &c. In general, we allow a polynomially bounded 
circuit width, w, so A, B, C, D, E, F must be unitary elements of 
A w . (Note that this definition of input could equally well have 
done for the classes described above, though it is not commonly 
employed.) 

The circuit output probability C n (x) is then taken to be (1 + 
f3) i '2 as per equations © and (H]), where (3 is found from the state 
resulting from the starting state (equation ([2])) being subject to 
the specified sequence of automorphisms. As before, C n (x) has 
the physical meaning of being the probability of measuring the 
first qubit of the final state to be | ) . (An example of a problem 
that fits this paradigm - indeed the canonical example - is that 
of estimating the trace of a unitary matrix. The input to the 
problem will amount to a classical description of the matrix. 
The details are described below in section [2j) 

Thus is the class BQ1P derivecll, assuming as before that the 
family of circuits is uniform. The precise rule that we adopt for 
defining BQ1P languages interprets probabilities a little more 
generally too; instead of requiring that CLi (x) (|, |) we simply 
require the (possibly) weaker condition that for n = \x\ 



for some known polynomially bounded complexity function q(n). 
The reason for allowing polynomially small error bounds is that 
there is no apparent way to 'chain together' (sequentially com- 
pose) or even parallel compose DQC1 circuits, and so unlike the 
BPP or BQP cases, amplification of probability has to take 
place outside of the model. And so any language in BQ1P must 

3 This is our notation, "Bounded-error Quantum - 1 pure qubit - Polynomial time 
computation" , but we welcome correction from those who know better how to name new 
classes. 




(11) 
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satisfy 



C := |x£{0,l}« : D(x) > -L^ |, (12) 
£ := {,€{0,1}' : «,) < 

The class is reasonably robust against variation in the exact 
specification of which unitaries are allowed within the circuit. 
Since the circuit families must be uniform (i.e. describable by 
a well-behaved logspace Turing machine,) it will be important 
to avoid encoding any 'complexity' in the alphabet of allowable 
unitaries. In general we will take a finite alphabet of gates and 
then work with its closure under conjugation by SWAP gates. 
The resulting alphabet (subset of A w ) will be of polynomially 
bounded size, and we shall want for it to form a universal set 
in the sense of being able to generate any unitary to arbitrary 
precision as w — > oo. An example of a such an alphabet would be 
the set of all CNot unitaries 1+z o +x k z i x k together with the set 

of all Hadamards an d all £ tt/8' unitaries exp(iZj7r/8). 

Robustness of classes with respect to changes in the defining 
gate-set is discussed amply in the literature (e.g. [1].) 

2 Completeness 

Although BQ1P is a semantic class, we can still provide a notion 
of completeness for it, by reference to the problem of estimating 
the (sign of the real part of the) trace of a unitary operator. 
This problem is discussed in [5], and we recap and expand on 
the main ideas here. 

Formally, the promise problem for Trace Estimation is : given 
a description of a unitary matrix in factored form (see below), 
decide whether the sign of the real part of the trace of the matrix 
is positive, given the promise that the magnitude of the real part 
of the trace is polynomially bounded away from zero. Specifi- 
cally, this means that if x is the description of the unitary U, 
and U belongs to the algebra A w , then the promise holds that 
| Re[Tr[U]] | > p ?\ x \\ for some polynomially bounded complexity 
function p. The dependence on w serves to make the problem 
well-defined, essentially remedying the fact that Trace applies 
a linear multiplier of 2 W . It may be noted [5] that for a ran- 
domly chosen U without any such promise, the typical value of 
| i?e[Tr[£/]] | is -\/2™, and that in any case the value is constrained 
to the region [0, 2 W ]. 
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Let {L W) k) w ,k be some sensibly chosen uniform family of maps 
that interpret binary strings of length k into unitary elements of 
A w ; (the details are unimportant.) Let |x| = kt and let 

U(x,w,k) = L W:k (x 1 ...x k )---L W:k (x kt -k+i---x kt ), (13) 

denote the unitary matrix that is obtained by parsing the string 
x in sections of k bits, using L w>k , and composing the resulting 
network of t gates (each acting on w qubits.) 

Lemma 1 The promise problem for Trace Estimation is com- 
plete (using classical logspace reductions) for BQ1P. 

To show completeness of the promise version of the Trace Es- 
timation problem for BQ1P we need to do two things : first 
we shall show that this version can be solved within the BQ1P 
methodology, and second we shall show that any other method- 
ology solving the promise problem can simulate the action of any 
other BQ1P computation. This will not however help us write 
down any particular language in BQ1P, because the promise in 
question cannot be readily encoded in the usual linguistic syntax. 
The notion of reduction that we use in reducing other problems 
to this Trace Estimation problem will be reduction by classical 
processing bound to logarithmic space [2j; see lemma|3]in section 
[3] for more details. 

Proof part 1) (c/ [5J) For the first part of the proof, we 
need a uniform family of DQC1 circuits such that circuit number 
(\x\,w,k), of width w' ', on input x, p start; wm generate a state for 
which j3{x) > l/q(\x\) if and only if Re[Tr[U(x, w, k)]] > 0, etc. 
To achieve this, we take w' = w + 1 and use a DQC1 circuit that 
begins and ends with a Hadamard gate on the first qubit, and 
in between these performs the relevant conditional unitaries on 
the remaining qubits, controlled also on the (quantum) setting 
of the first qubit to the state | 1 ). (The factors in equation (|13p 
can be broken down in a uniform wa)Q into choices of unitaries 
controlled by the classical bits of x, and for any fixed alphabet, it 
is a finite problem to break down controlled versions of gates into 
simpler forms allowed in the alphabet, with arbitrary accuracy.) 
Writing this out for any specific input x gives 

H 1 -K x {U{x : w,k))-H 1 . (14) 

This circuit, on input p s tart> wm yield the A w +i state 

-i . U(x,W,k) + lf' ! (x,W,k) ry ■ U(x,w, k) — f/t (x,W,k) -tr 

1 + 2 -Zl - I- 2 -Yl 

% cf section [3] 
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for which the (3 of equation flU) is Re[ Tr[ U(x, w, k) G A w }/2 w }. 
Invoking the promise, we can be sure that this j3 satisfies \f3\ > 
p Q x ^ , and so by taking q > p, we can ensure that the correct 
decision is made according to equation (|12p . even allowing room 
for small errors in any approximations that might be required 
when rendering equation (|14|) . 

Proof part 2) For the second part of the completeness proof, 
suppose we have some 'black box' means for deciding of the triple 
(x,w,k) whether the real part of the trace of U(x,w,k) is pos- 
itive, given that it is bounded away from zero. We shall use 
this black box, together with what amounts to merely logspace 
processing, to decide an arbitrary BQ1P language. The use of 
logspace processing will be 'justified' later in section El 

An arbitrary BQ1P language that we would have our black 
box decide may be written as 



(y, w, k) : Tr 



U(y, w, k) ■ Zf U\y, w, k) ■ Z x 



1 

> 



Q(\y\ 



(16) 

as derived from equations ([3]), Q, and (|12p . Then all we need 
do is to have x code for a related circuit of the same width and 
similar depth, so that U (x, w,k) = U (y, w, k) ■ Z\ ■ U^y, w, k) ■ Z\ 
and \x\ = 0(\y\); for then the black box will be able to decide on 
(y, w, k) via x. This will be a simple matter for any reasonably 
chosen L w ^- ■ 



3 Properties of DQC1 Circuits 

This section gathers together some important properties of the 
circuits we've been discussing, and of BQ1P itself. 



Boolean formulas on inputs 

Observe that the following small DQC1 circuit will 'compute' 
the boolean operation x\ A X2, deterministically, using a circuit 
of total width w > 1 : 

(1X^.(1X^.(1^!)^. (17) 

More precisely, this will map the starting state to (1 + (3Zi)/2 w 
where (3 = (_i) :r i A:c 2_ Likewise, other simple boolean gates may 
be rendered. Note that it is necessary to use one of the bits 
twice (bit x\ in this example) in order to perform a non-trivial 
gate. In general, it would be exceptionally limiting if we were 
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only allowed to use each input bit once, because after each input 
bit had been used, no further automorphisms would be able to 
change any of the algebraic relations held between the states 
arising from different x values. 

Does the ability to perform these simple boolean gates mean 
that it is straightforward to put P C BQ1P? Well, no, be- 
cause there is no method prescribed to take the output of one 
DQC1 circuit directly into the input of another DQC1 circuit, 
and therefore it is not enough to be able to perform interesting 
gates only on input bits, one must also be able to perform in- 
teresting gates on intermediate bits (whatever they might be.) 
Recall from equation Q that there is a somewhat limited 'place' 
in which 'intermediate data' can be held during a DQC1 com- 
putation. 

Complement 

It is obvious from the definitions that co-BQIP = BQ1P, and 
so we shall say no more about this, except to point out that 
the sign of (3 in equation ([3]) can be flipped at any time of our 
choosing by use of the unitary X\. 

0LC BQ1P 

The well known complexity class ®L, (see e.g. [3],) may be 
defined as comprising those languages that can be decided by 
a uniform family of circuits composed entirely of CNot gates, 
whereby the |x|th circuit decides on the string x by acting on 
the pure state | x ) prior to a (deterministic) measurement of 
the first qubit in the computational basis. Equivalently, using 
our idea of "input as classical control" , we could define (J) L by 
having a uniform family of circuits composed entirely of classical- 
input-controlled CNot gates, in like fashion to the DQC1 circuit 
definition. We can use this idea to show : 

Lemma 2 Class inclusion ; 0LC BQ1P. 

Suppose we had a uniform circuit family, the circuits being 
made up exclusively of CNot gates, (the bits of an input string 
x being used as an additional control over some or all of those 
CNot gates,) and suppose those circuits were intended to act on 
initial pure states of the form | 100. ..00). Suppose we were then 
to measure a given bit of the output. We could describe the 
process by writing 

C N (x) = | (0|i Tr [2M [ C/ N (x)|100...00> ] | 2 . (18) 
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Now suppose we were to make a dual family of circuits by 
taking every circuit from the original family and simply reversing 
the direction of each CNot gate therein, exchanging source qubit 
with target qubit, but retaining all of the classical controls on the 
gates from the bits in x. Write the action of this modified circuit 
as U\ x \{x). We intend that circuits of this dual family (U n ) n be 
applied (with the same control from the input string x,) on the 
DQC1 state given in equation ([2]). It is easy to see that if an 
original circuit U\ x \(x) would have mapped its pure starting state 
| 100. ..00) to | s\S2---s w ), say, where s% = 1 — C\ x \(x), then the 

dual circuit U\ x \{x) would map its mixed starting state p start 
to the state (1 + Z\ x Z s 2 2 ...Z^») /2 W . This is because whereas 
Ai(Xj) ■ | a)i\ b)j = \ a)i\a + b)j, we have things the other way 
around in DQC1 notation : 

Ai(Xj) ■ Zf ■ Z) ■ Ai(Xj) = Z^ h -Z). (19) 

Consider next the somewhat larger DQC1 circuit composed 
as Ul ■ X x ■ U n . For a given control string x, this will map the 
state p start to 1+ £l Z 1 , where = 2C\ x \(x) — 1, as required, as 
the reader may readily check. Thus for such circuit families, 
we can guarantee determinism in the final measurement, and so 
certainly a BQ1P language is issued. This shows that ®L C 
BQ1P. ■ 



More pure qubits 

The next thing to show is that the complexity class BQ1P re- 
mains the same if we allow a constant number of pure qubits 
instead on just one, or even a logarithmic number. 

To be precise, suppose we generalise the model by having c 
pure qubits and being allowed, after arbitrary unitary processing, 
to measure just whether the first d qubits are all in the state | ). 
For each circuit U{x) we define f3 c ,d accordingly, (c/. eqn (jlj)) : 



l + Z)® c t (1 + Z) 



2 d 



■ (20) 



There would then be no point in having d — c > 0(log |x|), 
since the signal strength would drop off superpolynomially for all 
U(x), and so there would be no way to satisfy the requirement 
of equation (llip anywhere. If we were to have c ~ d ~ O(logn) 
then equation (|20|) can be expanded to be the scaled sum of 
the traces of polynomially many unitary operators, and thus, we 
shall show, the action of the machine could be simulated in the 
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BQ1P model. There are essentially two ways to complete the 
simulation in this case, either by parallel application of polyno- 
mially many simulators followed by some classical accounting at 
the end, or by randomly choosing one of those unitary operators 
to run the simulation on, to obtain a (3 corresponding to the av- 
erage trace. The first technique is not apparently applicable to 
BQ1P methodology, because as we have already noted, there is 
no real notion of parallel composition in DQC1. But the second 
option can be simulated in BQ1P, and though it may provide 
a polynomially weaker signal, it won't be so weak as to violate 
the guarantee required by equation (fTT]) . 

Let us illustrate this simulation in more detail for the im- 
portant case of c = 2, d = 1, (using the temporary notation 
"BQ2P" to indicate the class generated from allowing for two 
pure qubits instead of one) : 

Lemma 3 Class equality : BQ2P = BQ1P 

The probability that we wish to sample is given by 



1 + /Ma0 



Tr 



(1 + Z 1 )(1 + Z 2 ) __, , t (1 + Z 



U(x) • ^ • U(x 



>i 



TrjUZiUtZi] Tr[UZ 2 U^Z 1 ] Tr[UZ 1 Z 2 U 1 ^ ZjJ 
J- 2 W ' 2 W 2 W 

2 + 2 ' 

(21) 

where U (and hence (3 2 ,i) depends on the input string, x. Now 
there is a DQC1 circuit of width w + 2s that gets very close to 
sampling the probability 1+ ^|' 1 ^ 3 , namely 

U(x) ■ Ai jtu+ 2s(^2) • A.2 ;W+ 2s-l(Xi) ■ ■ ■ Ai ;U)+ 2(^2) ' ^■2,v)+l(^l)- 

(22) 

This approximation works 'exponentially well' because the Tof- 
foli gates that source the extra 2s qubits are sourcing totally 
random qubits, and so they reduce to random CNot gates, which 
establish a rapid-mixing Markov process for simulating the al- 
ternative starting signal; this being an almost-uniform mix from 
the set {Zi, Z2, Z1Z2} instead of the usual Z\. So if there were 
a 'BQ2P language' according to the semantic guarantee 

IfeiMI > (23) 

then the same language would be in BQ1P according to the 
semantic guarantee 

SiM " <24) 
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as the reader may easily check by considering the effect of the 
random Toffoli gates of the circuit in equation (|22p on the start- 
ing state. Recall that q is polynomially bounded, so s need only 
be logarithmic. ■ 
This illustration will readily generalise to show that class 
BQ1P is not extended by increasing the number of pure in- 
put bits to a logarithmic amount, (nor indeed by increasing the 
number of output bits by any amount, under the measurement 
assumptions we made.) 

Reduction 

We are now in a position to show that : 

Lemma 4 Class reduction : BQ1P is closed under reduction 
by logspace deterministic processes. 

This is the usual notion of reduction employed in complexity 
theory, [2]. It will in turn justify our earlier assumptions in 
section [2] relating to reduction concepts. 

The idea is to use lemmas [2] and El so that if £ £ BQ1P 
and R is a function whose every bit is in (J) L, (which is a more 
general thing than merely saying that R is logspace determinis- 
tic computable,) then we can prove the 'pre-reduction' language 
£ = { x : R(x) e C } to be in BQ2P', and hence in BQ1P. 

So proceed by supposing that we do have the use of two pure 
qubits rather than just one, and then divide the available qubits 
into two partitions with one pure qubit in each partition. The 
first partition will be used to undergo the transformations of the 
circuit being implemented, while the second partition will be 
used to compute the bits of R as they are needed. Each time 
bit Ri(x) is needed to choose between unitaries for applying to 
the first partition, use the techniques described for lemma [2] to 
map the second partition from state /9 start = (1 + Z\)/2 W to 
state (1 + (-1) R ^Z 1 )/2 W . Then apply the unitary on the first 
partition controlled on the pure qubit in the second partition, 
before 'uncomputing' the computed control bit to restore the 
second partition to its original p start value. (The partitions do 
not become entangled nor even classically correlated, because 
the control is always on a bit known to be pure and in the com- 
putational basis.) Apply lemma [3] to finish the proof. ■ 

The closure under logspace reduction, stability under small 
pertubations to the model, and the various completeness prop- 
erties, give us some reassurance that the class has been defined 
correctly in a 'natural' way. 
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4 Computational complexity conjectures 

We subscribe to the school of thought which assumes that each 
of BPP ^ BQP and L 7^ P is true but too difficult to prove. 
Consistent with this belief, here are some other conjectures : 

Conjecture 1 P % BQ1P 

Conjecture 2 BQ1P % BPP 

Conjecture 3 BQP % BPP BQ1P : No hybrid machine con- 
sisting of classical processors and quantum-uncorrelated DQC1 
processors can convincingly simulate the action of a quantum 
Turing machine within polynomial time, even allowing for use 
of adaptive techniques ( classical correlations.) 

Clearly these conjectures are too strong to hope to prove affir- 
matively, given the belief expressed above. For example, a proof 
of conjecture[T]would immediately establish that ® L (and hence 
also L) is a proper subset of P, while a proof of conjecture [2] or [3] 
would confirm our belief that quantum computation is indeed 
superpolynomially more powerful than its classical counterpart. 



ion is divided as to whether it is even likely, let alone provable. 
A popular approach is to conjecture that BQ1P = BQP, and 
so one should make a serious attempt to look for evidence to 
resolve this discrepancy. Evidence for conjecture [2] can be found 
in section [2j since the problem of Trace Estimation appears to 
be classically hard. Conjecture [3] is a bit more speculative and 
intuitive, capturing our opinion that having access to many pure 
qubits in the same processor really is a valuable resource. 

Evidence by relativisation 

Our main reason for finding conjectures Q] and [2] plausible has 
to do with relativisation, using oracles. The standard model for 
oracles fits into the DQC1 model only by representing a classical 
oracle as an (unknown) non-uniform class of permutation matri- 
ces which we may use as building blocks within the circuits in 
a circuit family but which we may not modify nor examine any 
description ofO The idea is that the complexity of computation 

5 Ok, my methods aren't always especially scientific, and I'm not admitting in print 
just how many experts confessed a strong opinion on this matter. 

6 We assume that we are allowed to implement certain derivatives within the circuit, 
such as "controlled- U" , a polynomial number of times if necessary. 
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represented by that oracle can be utilised 'atomically' within 
the circuitry, despite the fact that DQC1 doesn't allow for direct 
concatenation of computations, nor 'inputs and outputs' in the 
traditional sense, (as discussed in the first section.) 

Let U refer to such a (2 u, -by-2 w ) matrix. Then it is clear that 
while P circuitry relativised to U can report back any entry of the 
matrix, the best that BQ1P circuitry can do (see section[5]) is to 
estimate (3 = Tr[Vjj]/2' w , where Vjj is some polynomial- length 
unitary 'word' in the algebra A w >, some of whose 'letters' may be 
the unknown U. That there is a definite separation here can be 
shown inductively. Let U be a unitary matrix with a ( J 5 ) in the 
top left corner and let U' be the same matrix but with the top left 
corner replaced by ( 5 J ), i.e. U' = U — 2| — }( — |. Whereas P 
circuitry is able to distinguish between these two oracles, we shall 
show that the difference Tr[Vu] — Tr[Vu'] cannot be significant, 
i.e. cannot be on the order 2 W /poly(w). First consider the case 
w' = w and use induction on the 'word-length index' t found in 
the expression Vu,t = AfU • A t -\ ■ U • ■ ■ U ■ A\. Now define 

Xt ■= max | Tr[ W ■ (V Ujt - V v > tt ) } |, W unitary. (25) 
w 

From here we see 

| Tr[ Vu, t } - Tr[ V v >, t } \ (26) 

< Xt 

= | Tr[ W ■ A t ■ (U ■ Vu,t-i - V • V v , tt -i) ] | 

= | Tr[ (WAtU^Vu^ - tfr',t-i) ] + 2( - \V ultt -iWA t \ - ) \ 

< Xt-i + 2, 

and so this difference between the traces cannot grow to the 
exponential required within polynomial word-length t. The ar- 
gument is very similar for w 1 > w, so we shall leave details to 
the reader. 

Therefore, if O is an oracle containing infinitely many (ran- 
domly chosen) examples of (U,U') pairs, then (almost surely) 
one can construct deterministic distinguishing algorithms in P 
that have no analogue in BQ1P, and hence an oracle-based de- 
cision language in the former that cannot be contained in the 
latter, so P° % BQ1P . 

For conjecture El consider taking U to be a random 2 w -by- 
2 W permutation matrix, implementing the permutation ir on 
GF(2) W , subject to a certain promise (to be specified shortly,) 
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and let V = H® w ■ U ■ H® w ■ U ■ H® w ■ U. Then 

2~5w/2 V^ _^Qt'7r(fc) © fc-7r(m) © m--w(i) (27) 
ikm 

2 w/2 ( 1 - 2 E[ i ■ ir(k) © k ■ ir(m) © m ■ tt(») ] ), 

and provided we have the promise that this value is sufficiently 
bounded away from 0, it is easy for BQ1P to resolve whether the 
value is closer to 1 or —1, but clearly any (randomised) classical 
device would have to query U at exponentially many places to 
establish that distinction. Therefore any O containing infinitely 
many (randomly chosen) examples of such U matrices will (al- 
most surely) exemplify a case where BQ1P % BPP . 
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